Automatic Acronym Identification and the Creation of an Acronym Database
نویسنده
چکیده
Over the past few decades there has been an explosion in the number on online technical documents, news stories and other formal documents. Associated with this large increase in documents has come a large increase in the number of acronyms defined. As a result of this there is now a need to create systems capable of automatically identifying acronyms and their meanings in order to help improve user understanding of technical documents. The aim of this project is to design and implement a system capable of identifying acronyms in free text and inferring their meanings. The databases of found acronyms should be available for querying online.
منابع مشابه
Efficient Acronym-Expansion Matching for Automatic Acronym Acquisition
Acronyms are a very dynamic area of many languages. An efficient dynamic programming algorithm for matching acronyms with their expansions by maximizing a linguistic plausibility score is presented and is found to be very accurate, to =99.6% on a corpus of acronym definitions. Given its high precision, the algorithm can be used as a component in new or existing automatic acronym acquisition sys...
متن کاملADAM: another database of abbreviations in MEDLINE
MOTIVATION Abbreviations are an important type of terminology in the biomedical domain. Although several groups have already created databases of biomedical abbreviations, these are either not public, or are not comprehensive, or focus exclusively on acronym-type abbreviations. We have created another abbreviation database, ADAM, which covers commonly used abbreviations and their definitions (o...
متن کاملExtraction and Disambiguation of Acronym Meaning-Pairs in Medline
Acronyms are widely used in biomedical and other technical texts. Understanding their meaning constitutes an important problem in the automatic extraction and mining of information from text. Moreover, an even harder problem is sense disambiguation of acronyms; that is, where a single acronym, termed a polynym, has a multiplicity of meanings, a common occurrence in the biomedical literature. In...
متن کاملAutomatic Acronym Recognition
This paper deals with the problem of recognizing and extracting acronymdefinition pairs in Swedish medical texts. This project applies a rule-based method to solve the acronym recognition task and compares and evaluates the results of different machine learning algorithms on the same task. The method proposed is based on the approach that acronym-definition pairs follow a set of patterns and ot...
متن کاملAutomatic Extraction of Acronym-meaning Pairs from MEDLINE Databases
Acronyms are widely used in biomedical and other technical texts. Understanding their meaning constitutes an important problem in the automatic extraction and mining of information from text. Here we present a system called ACROMED that is part of a set of Information Extraction tools designed for processing and extracting information from abstracts in the Medline database. In this paper, we pr...
متن کامل